Multi-objective optimization for clustering 3-way gene expression data

نویسنده

  • Doulaye Dembélé
چکیده

The microarray technology allows to monitor the expression level of thousands of genes simultaneously. A typical experiment will for example compare gene expression between multiple biological samples such as tumor biopsies, or a single sample in response to a treatment over time. It is assumed that genes with similar function or sharing regulatory elements will display a common expression profile over a variety of biological conditions. Hence, classification methods are used to group genes according to their expression profile in a defined set of samples and/or to group samples based on the set of genes they express [1, 4]. Unsupervised classification methods or clustering are typically used when no a priori information is available on samples or genes [7, 2]. In biomedical research, you may want to study simultaneously different drugs in different experimental conditions (concentration, time point, . . . ) on a biological model, leading to the generation on 3-way data. To search for interesting genes, clustering can be performed separately for each drug. Doing in this way does not take into account the interaction between drugs and samples comparison conditions. In this paper, we used the Fuzzy C-Means method for clustering 3-way gene expression data via optimization of multiple objectives. A reformulation of the total criterion is used to obtain an expression which has less number of variables compared to the traditional FCM criterion. This transformation allows to use a global optimizer, the Genetic Algorithm, to partition genes into clusters of similar profile. Let T be the number of test drugs or data level and K the number of groups to get from the dataset containing N genes. The total criterion to minimize is given by the following equation: J(uki, ck) = T ∑

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm

Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...

متن کامل

Improved Automatic Clustering Using a Multi-Objective Evolutionary Algorithm With New Validity measure and application to Credit Scoring

In data mining, clustering is one of the important issues for separation and classification with groups like unsupervised data. In this paper, an attempt has been made to improve and optimize the application of clustering heuristic methods such as Genetic, PSO algorithm, Artificial bee colony algorithm, Harmony Search algorithm and Differential Evolution on the unlabeled data of an Iranian bank...

متن کامل

Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis

Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...

متن کامل

An Approach to Reducing Overfitting in FCM with Evolutionary Optimization

Fuzzy clustering methods are conveniently employed in constructing a fuzzy model of a system, but they need to tune some parameters. In this research, FCM is chosen for fuzzy clustering. Parameters such as the number of clusters and the value of fuzzifier significantly influence the extent of generalization of the fuzzy model. These two parameters require tuning to reduce the overfitting in the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Adv. Data Analysis and Classification

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2008